An Effective Combination Based on Class-Wise Expertise of Diverse Classifiers for Predictive Toxicology Data Mining

نویسندگان

  • Daniel Neagu
  • Gongde Guo
  • Shanshan Wang
چکیده

This paper presents a study on the combination of different classifiers for toxicity prediction. Two combination operators for the Multiple-Classifier System definition are also proposed. The classification methods used to generate classifiers for combination are chosen in terms of their representability and diversity and include the Instance-based Learning algorithm (IBL), Decision Tree learning algorithm (DT), Repeated Incremental Pruning to Produce Error Reduction (RIPPER), Multi-Layer Perceptrons (MLPs) and Support Vector Machine (SVM). An effective approach of combining classwise expertise of diverse classifiers has been proposed and evaluated on seven toxicity data sets. The experimental results show that the performance of the combined classifier over seven data sets can achieve 69.24% classification accuracy on average, which is 11.12% better than that of the best classifier (generated by MLP), among five classification methods studied.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of Chemical Named Entity Recognition through Sentence-based Random Under-sampling and Classifier Combination

Chemical Named Entity Recognition (NER) is the basic step for consequent information extraction tasks such as named entity resolution, drug-drug interaction discovery, extraction of the names of the molecules and their properties. Improvement in the performance of such systems may affects the quality of the subsequent tasks. Chemical text from which data for named entity recognition is extracte...

متن کامل

Extracting Predictor Variables to Construct Breast Cancer Survivability Model with Class Imbalance Problem

Application of data mining methods as a decision support system has a great benefit to predict survival of new patients. It also has a great potential for health researchers to investigate the relationship between risk factors and cancer survival. But due to the imbalanced nature of datasets associated with breast cancer survival, the accuracy of survival prognosis models is a challenging issue...

متن کامل

Predicting Implantation Outcome of In Vitro Fertilization and Intracytoplasmic Sperm Injection Using Data Mining Techniques

Objective The main purpose of this article is to choose the best predictive model for IVF/ICSI classification and to calculate the probability of IVF/ICSI success for each couple using Artificial intelligence. Also, we aimed to find the most effective factors for prediction of ART success in infertile couples. MaterialsAndMethods In this cross-sectional study, the data of 486 patients are colle...

متن کامل

Expert Discovery: A web mining approach

Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...

متن کامل

Class-wise multi-classifier combination based on Dempster-Shafer theory

Multi-classifier combination based on Dempster-Shafer theory of evidence has demonstrated it’s superior performance. In the approach based on Dempster-Shafer theory, the basic probability assignments for evidence are usually derived from classifiers’ global performance. However, our study discovered that while using classifiers’ global performance as basic probability assignments doesn’t necess...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006